Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extending test timeout on slow platforms #915

Merged
merged 6 commits into from
Feb 13, 2024

Conversation

adamfarley
Copy link
Contributor

Some platforms can take over 10 hours to run a full suite of extended.openjdk tests, even when split between 3 parallel machines.

Therefore we should extend the timeout to allow the tests to run to completion.

@adamfarley adamfarley self-assigned this Feb 5, 2024
Copy link

github-actions bot commented Feb 5, 2024

Thank you for creating a pull request!

Please check out the information below if you have not made a pull request here before (or if you need a reminder how things work).

Code Quality and Contributing Guidelines

If you have not done so already, please familiarise yourself with our Contributing Guidelines and Code Of Conduct, even if you have contributed before.

Tests

Github actions will run a set of jobs against your PR that will lint and unit test your changes. Keep an eye out for the results from these on the latest commit you submitted. For more information, please see our testing documentation.

In order to run the advanced pipeline tests (executing a set of mock pipelines), it requires an admin to post run tests on this PR.
If you are not an admin, please ask for one's attention in #infrastructure on Slack or ping one here.
To run full set of tests, use "run tests"; a subset of tests on specific jdk version, use "run tests quick 11,21"

@adamfarley
Copy link
Contributor Author

Fixes #852

Copy link
Member

@sxa sxa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Anything further needed for this to have the WIP prefix removed?

@adamfarley
Copy link
Contributor Author

Just a successful test run. Whether it prevents the issue or not, I want to be sure this hasn't broken anything.

@sxa
Copy link
Member

sxa commented Feb 6, 2024

Just a successful test run. Whether it prevents the issue or not, I want to be sure this hasn't broken anything.

While you're waiting - any chance you can adjust the commit message to call out s390x and riscv64 explicitly - it can be really useful when searching commit messages in the future.

These platforms can take over 10 hours to run a full suite of
extended.openjdk tests, even when split between 3 parallel machines.

Therefore we should extend the timeout to allow the tests to run to
completion.

Signed-off-by: Adam Farley <[email protected]>
@adamfarley adamfarley force-pushed the extend_test_time_limit_on_riscv branch from 4e3b21e to 2384ca8 Compare February 6, 2024 16:35
@adamfarley
Copy link
Contributor Author

Just a successful test run. Whether it prevents the issue or not, I want to be sure this hasn't broken anything.

While you're waiting - any chance you can adjust the commit message to call out s390x and riscv64 explicitly - it can be really useful when searching commit messages in the future.

Sure, done.

@karianna
Copy link
Contributor

karianna commented Feb 7, 2024

@adamfarley Is that a manual test run or since hte GH Action Checks have passed we can merge this?

@adamfarley
Copy link
Contributor Author

adamfarley commented Feb 7, 2024

@adamfarley Is that a manual test run or since hte GH Action Checks have passed we can merge this?

It's a manual test run referred to in the issue here.

Unfortunately the change does not appear to be working. I'm looking into it now.

Signed-off-by: Adam Farley <[email protected]>
@adamfarley
Copy link
Contributor Author

Ok, here's a rerun with some debug messages:

https://ci.adoptium.net/job/build-scripts/job/jobs/job/jdk22/job/jdk22-linux-s390x-temurin/16/

And here's a rerun using the upstream pipeline job, in case this is one of those things that's outside the scope of the job parameters.

https://ci.adoptium.net/job/build-scripts/job/openjdk22-pipeline/118/

This way we only override the test job's default time limit if we've
set it explicitly.

Signed-off-by: Adam Farley <[email protected]>
@adamfarley
Copy link
Contributor Author

adamfarley commented Feb 13, 2024

Ok, I've put in the fix for the TIME_LIMIT-not-being passed downstream issue, and I've refined the formatting.

If this job passes the variable correctly, we're good to merge.

https://ci.adoptium.net/job/build-scripts/job/jobs/job/jdk22/job/jdk22-linux-s390x-temurin/21/

Signed-off-by: Adam Farley <[email protected]>
@adamfarley
Copy link
Contributor Author

Variable passes correctly. Good to go. Requesting a re-review as the code was changed significantly after the last reviews.

@adamfarley adamfarley changed the title WIP: Extending test timeout on slow platforms Extending test timeout on slow platforms Feb 13, 2024
@sxa
Copy link
Member

sxa commented Feb 13, 2024

Noting that https://ci.adoptium.net/job/Test_openjdk22_hs_extended.openjdk_s390x_linux_testList_0/13/ took just over 20 hours on test-marist-ubuntu2204-s390x-1) (That job seems to have been run with TIME_LIMIT=30 - perhaps this needs to be a touch higher at 25? Or we need to understand why that machine seems slower than the others and resolve that :-) https://ci.adoptium.net/job/Test_openjdk22_hs_extended.openjdk_riscv64_linux_testList_0/4/ on RISC-V was similar (just over 20 hours) so while we have a wider range of machine specs on that platform 25 might be a preferable option there too.

@karianna karianna merged commit 64a5a21 into adoptium:master Feb 13, 2024
7 checks passed
@adamfarley
Copy link
Contributor Author

adamfarley commented Feb 14, 2024

Okie dokie, will make another PR. - #928

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants